Detecting Misleading Information on COVID-19

Mohamed K Elhadad; Kin Fun Li; Fayez Gebali

doi:10.1109/ACCESS.2020.3022867

Detecting Misleading Information on COVID-19

IEEE Access. 2020 Sep 9:8:165201-165215. doi: 10.1109/ACCESS.2020.3022867. eCollection 2020.

Authors

Mohamed K Elhadad¹, Kin Fun Li¹, Fayez Gebali¹

Affiliation

¹ Department of Electrical and Computer EngineeringUniversity of Victoria Victoria V8W 2Y2 Canada.

Abstract

This article addresses the problem of detecting misleading information related to COVID-19. We propose a misleading-information detection model that relies on the World Health Organization, UNICEF, and the United Nations as sources of information, as well as epidemiological material collected from a range of fact-checking websites. Obtaining data from reliable sources should assure their validity. We use this collected ground-truth data to build a detection system that uses machine learning to identify misleading information. Ten machine learning algorithms, with seven feature extraction techniques, are used to construct a voting ensemble machine learning classifier. We perform 5-fold cross-validation to check the validity of the collected data and report the evaluation of twelve performance metrics. The evaluation results indicate the quality and validity of the collected ground-truth data and their effectiveness in constructing models to detect misleading information.

Keywords: COVID-19; Coronavirus; SARS-CoV-2; WHO; fake news detection; infodemic; misleading information; pandemic; social media; social networks; text classification; text mining; web mining.

This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.